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DNA RECOMBINATION IN EUKARYOTIC CELLS BY THE 
BACTERIOPHAGE PHIC31 RECOMBINATION SYSTEM 

5 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

This invention was made with government support under Grant No. 5335- 
21000-009-06S, awarded by the United States Department of Agriculture, Agricultural 
Research Service. The Government has certain rights in the invention. 

10 CROSS-REFERENCE TO RELATED APPLICATION 

This application claims the benefit of US Provisional Application No. 
60/145,469, filed July 23, 1999, which application is incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

1 5 This invention pertains to the field of methods for obtaining specific and 

stable integration of nucleic acids into chromosomes of eukaryotes. The invention makes use 
of site-specific recombination systems that use prokaryotic recombinase polypeptides, such 
as the <t>C3 1 integrase. 

Background 

20 Genetic transformation of eukaryotes often suffers from significant 

shortcomings. For example, it is often difficult to reproducibly obtain integration of a 
transgene at a particular locus of interest Homologous recombination generally occurs only 
at a very low frequency. To overcome this problem, site-specific recombination systems 
have been employed. These methods involve the use of site-specific recombination systems 

25 that can operate in higher eucaryotic cells. 

Many bacteriophage and integrative plasmids encode site-specific 
recombination systems that enable the stable incorporation of their genome into those of 
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their hosts. In these systems, the minimal requirements for the recombination reaction are a 
recombinase enzyme, or integrase, which catalyzes the recombination event, and two 
recombination sites (Sadowski (1986) J. Bacteriol. 165: 341-347; Sadowski (1993) FASEB 
J. 7: 760-767). For phage integration systems, these are referred to as attachment (att) sites, 
5 with an attP element from phage DNA and the attB element encoded by the bacterial 

genome. The two attachment sites can share as little sequence identity as a few base pairs. 
The recombinase protein binds to both att sites and catalyzes a conservative and reciprocal 
exchange of DNA strands that result in integration of the circular phage or plasmid DNA 
into host DNA. Additional phage or host factors, such as the DNA bending protein IHF, 
10 integration host /actor, may be required for an efficient reaction (Friedman (1 988) Cell 

55:545-554; Finkel & Johnson (1992) Mol. Microbiol. 6: 3257-3265). The reverse excision 
reaction sometimes requires an additional phage factor, such as the xis gene product of phage 
X. (Weisberg & Landy (1983) "Site-specific recombination in phage lambda." In Lambda II, 
eds. Hendrix et al. (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) pp.21 1-250; 
15 Landy (1989) Ann. Rev. Biochem. 58: 913-949. 

The recombinases have been categorized into two groups, the X integrase 
(Argos et al. (1986) EMBOJ. 5: 433-44; Voziyanov et al. (1999) Nucl. Acids Res. 27: 930- 
941) and the resolvase/invertase (Hatfull & Grindley (1988) "Resolvases and DNA- 
invertases: a family of enzymes active in site-specific recombination" In Genetic 
20 Recombination, eds. Kucherlipati, R., & Smith, G. R. (Am. Soc. Microbiol., Washington 
DC), pp. 357-396) families. These vary in the structure of the integrase enzymes and the 
molecular details of their mode of catalysis (Stark et al. (1992) Trends Genetics 8: 432-439). 
The temperate Streptomyces phage OC31 encodes a 68 kD recombinase of the latter class. 
The efficacy of the <DC31 integrase enzyme in recombining its cognate attachment sites was 
25 recently demonstrated in vitro and in vivo in recA mutant Escherichia coli (Thorpe & Smith 
(1998) Proc. Nat'l. Acad. Sci. USA 95: 5505-5510). The 3C31 integration reaction is simple 
in that it does not require a host factor and appears irreversible, most likely because an 
additional phage protein is required for excision. The phage and bacterial att sites share only 
three base pairs of homology at the point of cross-over. This homology is flanked by 
30 inverted repeats, presumably binding sites for the integrase protein. The minimal known 
functional size for both attB and attP is ~50 bp. 
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The CtQ-lox system of bacteriophage PI, and the FLP-FRT system of 
Saccharomyces cerevisiae have been widely used for transgene and chromosome 
engineering in animals and plants (reviewed by Sauer (1994) Curr. Opin. Biotechnol. 5: 521- 
527; Ow (1996) Curr. Opin. Biotechnol. 7: 181-186). Other systems that operate in animal 

5 or plant cells include the following: 1) the K-RS system from Zygosaccharomyces rouxii 
(Onouchi et al. (1995) Mol. Gen. Genet. 247: 653-660), 2) the Gin-git system from 
bacteriophage Mu (Maeser & Kahmann (1991) Mol. Gen. Genet. 230: 170-176) and, 3) the p 
recombinase-su system from bacterial plasmid pSM19035 (Diaz et al. (1999) J. Biol. Chem. 
27 '4: 6634-6640). By using the site-specific recombinases, one can obtain a greater frequency 

10 ofintegration. 

However, these five systems suffer from a significant shortcoming. Each of 
these systems have in common the property that a single polypeptide recombinase catalyzes 
the recombination between two sites of identical or nearly identical sequences. The product- 
sites generated by recombination are themselves substrates for subsequent recombination. 
15 Consequently, recombination reactions are readily reversible. Since the kinetics of 

intramolecular interactions are favored over intermolecular interactions, these recombination 
systems are efficient for deleting rather than integrating DNA. Thus, a need exists for 
methods and systems for obtaining stable site-specific integration of transgenes. The present 
invention fulfills this and other needs. 

20 SUMMARY OF THE INVENTION 

The present invention provides methods for obtaining stable, site-specific 
recombination in a eukaryotic cell. Unlike previously known methods for site-specific 
recombination, the recombinants obtained using the methods of the invention are stable. The 
recombination reaction is not reversible. 

25 The methods involve providing a eukaryotic cell that comprises a first 

recombination site and a second recombination site, which second recombination site can 
serve as a substrate for recombination with the first recombination site. The first and the 
second recombination sites are contacted with a prokaryotic recombinase polypeptide, 
resulting in recombination between the recombination sites, thereby forming one or two 

30 hybrid recombination sites. Significantly, the recombinase polypeptide is one that can 
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mediate site-specific recombination between the first and second recombination sites, but 
cannot mediate recombination between the two hybrid recombination sites in the absence of 
an additional phage-produced factor that is not present in the eukaryotic cell. Either or both 
of the recombination sites can be present in a chromosome of the eukaryotic cell. In some 

5 embodiments, one of the recombination sites is present in the chromosome and the other is 
included within a nucleic acid that is to be integrated into the chromosome. 

The invention also provides eukaryotic cells that contain aprokaryotic 
recombinase polypeptide or a nucleic acid that encodes a prokaryotic recombinase. In these 
embodiments, the recombinase is one that can mediate site-specific recombination between a 

10 first recombination site and a second recombination site that can serve as a substrate for 
recombination with the first recombination site, but in the absence of an additional factor 
that is not present in the eukaryotic cell cannot mediate recombination between two hybrid 
recombination sites that are formed upon recombination between the first recombination site 
and the second recombination site. In presently preferred embodiments, the cells of the 

1 5 invention include a nucleic acid that has a coding sequence for a recombinase polypeptide. 
The recombinase coding sequence is preferably operably linked to a promoter that mediates 
expression of the recombinase-encoding polynucleotide in the eukaryotic cell. The 
eukaryotic cells of the invention can be an animal cell, a plant cell, a yeast cell or a fungal 
cell, for example. 

20 In additional embodiments, the invention provides methods for obtaining a 

eukaryotic cell having a stably integrated transgene. These methods involve introducing a 
nucleic acid into a eukaryotic cell that comprises a first recombination site, wherein the 
nucleic acid comprises the transgene of interest and a second recombination site which can 
serve as a substrate for recombination with the first recombination site. The first and second 

25 recombination sites are contacted with a prokaryotic recombinase polypeptide. The 
recombinase polypeptide catalyzes recombination between the first and second 
recombination sites, resulting in integration of the nucleic acid at the first recombination site, 
thereby forming a hybrid recombination site at each end of the nucleic acid. Again, the 
recombinase polypeptide is one that can mediate site-specific recombination between the 

30 first and second recombination sites, but cannot mediate recombination between two hybrid 
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recombination sites in the absence of an additional factor that is not present in the eukaryotic 
cell. 

Additional embodiments of the invention provide nucleic acids that include a 
polynucleotide sequence that encodes a bacterial recombinase polypeptide operably linked to 
a promoter that functions in a eukaryotic cell. The recombinase polypeptides encoded by 
these nucleic acids of the invention cannot mediate recombination between two hybrid 
recombination sites that are formed upon recombination between a first recombination site 
and a second recombination site in the absence of a bacteriophage factor that is not present in 
the eukaryotic cells. In some embodiments, the nucleic acids further include at least one 
recombination site that is recognized by the recombinase polypeptide. 

Also provided by the invention are eukaryotic cells that include a 
polynucleotide that has one or more bacteriophage OC31 recombination sites, or 
recombination sites for other recombinases that cannot mediate recombination between two 
hybrid recombination sites that are formed upon recombination between a first 
recombination site and a second recombination site in the absence of a bacteriophage factor 
that is not present in the eukaryotic cells. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic (not to scale) representation of the chromosome 
structure at the S. pombe leul locus. Homologous insertion of pLT44 into the chromosome 

) (Figure 1 A) places a <DC3 1 attP target between leul alleles as shown in Figure IB. pLT43 
promoted site-specific integration of pLT45 into the chromosomal attP target leads to the 
structure shown in Figure 1C. Arrowheads indicate PCR primers corresponding to the 17 
promoter (T7), T3 promoter (T3) and ura4* coding region (U4). Predicted sizes of Xbal (X) 
cleavage products are shown. 

5 Figure 2 shows a schematic of an experiment which demonstrated that <DC3 1 

integrase catalyzes site-specific integration of a transgene encoding green fluorescent protein 
(GFP) in CHO cells. 

Figure 3 shows a schematic diagram of an experiment which demonstrated 
that <DC31 catalyzes specific recombination at an attB site to insert a hygromycin 

10 phosphotransferase gene downstream of a chromosomally located promoter. Successful 
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integration produces a Pc-attL-hpt linkage and a hygromycin resistance phenotype. The 
effect of different lengths of attP and attB sites were analyzed using the plasmids indicated. 

Figure 4 shows a schematic diagram of an experiment which demonstrates 
that <J>C31 integrase catalyzes the excision of a DNA flanked by attB and att? sites from the 
5 tobacco genome. 

Figure 5 shows a schematic diagram of an experiment in which <E>C31 
integrase was shown to catalyze integration of a transgene into the tobacco genome. 

DETAILED DESCRIPTION 

Definitions 

1 0 An "exogenous DNA segment", "heterologous polynucleotide" a "transgene" 

or a "heterologous nucleic acid", as used herein, is one that originates from a source foreign 
to the particular host cell, or, if from the same source, is modified from its original form. 
Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular 
host cell, but has been modified. Thus, the terms refer to a DNA segment which is foreign or 
1 5 heterologous to the cell, or homologous to the cell but in a position within the host cell 
nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are. 
expressed to yield exogenous polypeptides. 

The term "gene" is used broadly to refer to any segment of DNA associated 
with a biological function. Thus, genes include coding sequences and/or the regulatory 
20 sequences required for their expression. Genes can also include nonexpressed DNA 

segments that, for example, form recognition sequences for other proteins. Genes can be 
obtained from a variety of sources, including cloning from a source of interest or 
synthesizing from known or predicted sequence information, and may include sequences 
designed to have desired parameters. 
25 The term "isolated", when applied to a nucleic acid or protein, denotes that 

the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
either a dry or aqueous solution. Purity and homogeneity are typically determined using 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 
30 performance liquid chromatography. A protein which is the predominant species present in a 
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preparation is substantially purified. In particular, an isolated gene is separated from open 
reading frames which flank the gene and encode a protein other than the gene of interest. 
The term "purified" denotes that a nucleic acid or protein gives rise to essentially one band 
in an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 
5 about 50% pure, more preferably at least about 85% pure, and most preferably at least about 
99% pure. 

The term "naturally-occurring" is used to describe an object that can be found 
in nature as distinct from being artificially produced by man. For example, a polypeptide or 
polynucleotide sequence that is present in an organism (including viruses) that can be 
1 0 isolated from a source in nature and which has not been intentionally modified by man in the 
laboratory is naturally-occurring. 

The term "nucleic acid" or "polynucleotide" refers to deoxyribonucleotides 
or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless 
specifically limited, the term encompasses nucleic acids containing known analogues of 
1 5 natural nucleotides which have similar binding properties as the reference nucleic acid and 
are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise 
indicated, a particular nucleic acid sequence also implicitly encompasses conservatively 
modified variants thereof (e.g. degenerate codon substitutions) and complementary 
sequences and as well as the sequence explicitly indicated. Specifically, degenerate codon 
20 substitutions may be achieved by generating sequences in which the third position of one or 
more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 
(Batzere/a/. (1991) Nucleic Acid Res. 19: 5081; Ohtsukaefc/. (1985) J. Biol. Chem. 260: 
2605-2608; Cassol et al. (1992) ; Rossolini et al. (1994) Mol Cell. Probes 8: 91-98). The 
term nucleic acid is used interchangeably with gene, cDNA, and mRNA encoded by a gene. 
25 "Nucleic acid derived from a gene" refers to a nucleic acid for whose 

synthesis a gene, or a subsequence thereof (e.g., coding region), has ultimately served as a 
template. Thus, an mRNA, a cDNA reverse transcribed from an mRNA, an RNA transcribed 
from that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified 
DNA, etc., are all derived from the gene and detection of such derived products is indicative 
30 of the presence and/or abundance of the original. 
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A DNA segment is "operably linked" when placed into a functional 
relationship with another DNA segment. For example, DNA for a signal sequence is 
operably linked to DNA encoding a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to 
5 a coding sequence if it stimulates the transcription of the sequence. Generally, DNA 

sequences that are operably linked are contiguous, and in the case of a signal sequence both 
contiguous and in reading phase. However, enhancers, for example, need not be contiguous 
with the coding sequences whose transcription they control. Linking is accomplished by 
ligation at convenient restriction sites or at adapters or linkers inserted in lieu thereof. 
10 "Plant" includes whole plants, plant organs {e.g., leaves, stems, roots, etc.), 

seeds and plant cells and progeny of same. The class of plants that can be used in the 
methods of the invention is generally as broad as the class of higher plants amenable to 
transformation techniques, including both monocotyledonous and dicotyledonous plants. 
"Promoter" refers to a region of DNA involved in binding the RNA 
15 polymerase to initiate transcription. An "inducible promoter" refers to a promoter that directs 
expression of a gene where the level of expression is alterable by environmental or 
developmental factors such as, for example, temperature, pH, transcription factors and 
chemicals. 

The term "recombinant" when used with reference to a cell indicates that the 
20 cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a 

heterologous nucleic acid. Recombinant cells can contain polynucleotides that are not found 
within the native (non-recombinant) form of the cell. Recombinant cells can also contain 
polynucleotides found in the native form of the cell wherein the polynucleotides are 
modified and re-introduced into the cell by artificial means. The term also encompasses cells 
25 that contain a nucleic acid endogenous to the cell that has been modified without removing 
the nucleic acid from the cell; such modifications include those obtained by gene 
replacement, site-specific mutation, and related techniques. 

A "recombinant expression cassette" or simply an "expression cassette" is a 
nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements 
30 that are capable of effecting expression of a structural gene in hosts compatible with such 
sequences. Expression cassettes include at least promoters and optionally, transcription 
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termination signals. Typically, the recombinant expression cassette includes a nucleic acid 

to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a promoter. 

Additional factors necessary or helpful in effecting expression may also be used as described 

herein. For example, an expression cassette can also include nucleotide sequences that 
5 encode a signal sequence that directs secretion of an expressed protein from the host cell. 

Transcription termination signals, enhancers, and other nucleic acid sequences that influence 

gene expression, can also be included in an expression cassette. 

"Recombinase" refers to an enzyme that catalyzes recombination between 

two or more recombination sites. Recombinases useful in the present invention catalyze 
1 0 recombination at specific recombination sites which are specific polynucleotide sequences 

that are recognized by a particular recombinase. The term "integrase" refers to a type of 

recombinase. 

'Transformation rate" refers to the percent of cells that successfully 
incorporate a heterologous polynucleotide into its genome and survive. 

1 5 The term "transgenic" refers to a cell that includes a specific modification that 

was introduced into the cell, or into an ancestor of the cell. Such modifications can include 
one or more point mutations, deletions, insertions, or combinations thereof. When referring 
to an animal, the term "transgenic" means that the animal includes cells that are transgenic. 
An animal that is composed of both transgenic and non-transgenic cells is referred to herein 

20 as a "chimeric" animal. 

The term "vector" refers to a composition for transferring a nucleic acid (or 
nucleic acids) to a host cell. A vector comprises a nucleic acid encoding the nucleic acid to 
be transferred, and optionally comprises a viral capsid or other materials for facilitating entry 
of the nucleic acid into the host cell and/or replication of the vector in the host cell (e.g., 

25 reverse transcriptase or other enzymes which are packaged within the capsid, or as part of 
the capsid). 

"Recombination sites" are specific polynucleotide sequences that are 
recognized by the recombinase enzymes described herein. Typically, two different sites are 
involved (termed "complementary sites"), one present in the target nucleic acid (e.g., a 
30 chromosome or episome of a eukaryote) and another on the nucleic acid that is to be 
integrated at the target recombination site. The terms "attB" and "attP," which refer to 
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attachment (or recombination) sites originally from a bacterial target and a phage donor, 
respectively, are used herein although recombination sites for particular enzymes may have 
different names. The recombination sites typically include left and right arms separated by a 
core or spacer region. Thus, an attB recombination site consists of BOB', where B and B' are 

5 the left and right arms, respectively, and O is the core region. Similarly, attP is POP', where 
P and P' are the arms and O is again the core region. Upon recombination between the attB 
and attP sites, and concomitant integration of a nucleic acid at the target, the recombination 
sites that flank the integrated DNA are referred to as "attL" and "attR." The attL and attR 
sites, using the terminology above, thus consist of BOP' and POB', respectively. In some 

1 0 representations herein, the "O" is omitted and attB and attP, for example, are designated as 
BB' and PP', respectively. 

Description of the Preferred Embodiments 

The present invention provides methods for obtaining site-specific 
recombination in eukaryotic cells. Unlike previously known systems for obtaining site- 

15 specific recombination, the products of the recombinations performed using the methods of 
the invention are stable. Thus, one can use the methods to, for example, introduce transgenes 
into chromosomes of eukaryotic cells and avoid the excision of the transgene that often 
occurs using previously known site-specific recombination systems. Stable inversions, 
translocations, and other rearrangements can also be obtained. 

20 The invention employs prokaryotic recombinases, such as bacteriophage 

integrases, that are unidirectional in that they can catalyze recombination between two 
complementary recombination sites, but cannot catalyze recombination between the hybrid 
sites that are formed by this recombination. One such recombinase, the OC3 1 integrase, by 
itself catalyzes only the attB x attP reaction. The integrase cannot mediate recombination 

25 between the attL and attR sites that are formed upon recombination between attB and attP. 
Because recombinases such as the <1>C3 1 integrase cannot alone catalyze the reverse 
reaction, the 3>C31 attB x attP recombination is stable. This property is one that sets the 
methods of the present invention apart from site-specific recombination systems currently in 
use for eucaryotic cells, such as the Cre-fox or FLP-FRT system, where the recombination 

30 reactions can readily reverse. Use of the recombination systems of the invention provides 
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new opportunities for directing stable transgene and chromosome rearrangements in 
eukaryotic cells. 

The methods involve contacting a pair of recombination sites (e.g., attB and 
attP) that are present in a eukaryotic cell with a corresponding recombinase. The 
5 recombinase then mediates recombination between the recombination sites. Depending upon 
the relative locations of the two recombination sites, any one of a number of events can 
occur as a result of the recombination. For example, if the two recombination sites are 
present on different nucleic acid molecules, the recombination can result in integration of 
one nucleic acid molecule into a second molecule. Thus, one can obtain integration of a 
10 plasmid that contains one recombination site into a eukaryotic cell chromosome that includes 
the corresponding recombination site. Because the recombinases used in the methods of the 
invention cannot catalyze the reverse reaction, the integration is stable. Such methods are 
useful, for example, for obtaining stable integration into the eukaryotic chromosome of a 
transgene that is present on the plasmid. 
j 5 The two recombination sites can also be present on the same nucleic acid 

molecule. In such cases, the resulting product typically depends upon the relative orientation 
of the sites. For example, recombination between sites that are in the direct orientation will 
generally result in excision of any DNA that lies between the two recombination sites. In 
contrast, recombination between sites that are in the reverse orientation can result in 
20 inversion of the intervening DNA. Again, the resulting rearranged nucleic acid is stable in 
that the recombination is irreversible in the absence of an additional factor, generally 
encoded by the particular bacteriophage from which the recombinase is derived, that is not 
normally found in eukaryotic cells. One example of an application for which this method is 
useful involves the placement of a promoter between the two recombination sites. If the 
25 promoter is initially in the opposite orientation relative to a coding sequence that is to be 
expressed by the promoter and the recombination sites that flank the promoter are in the 
inverted orientation, contacting the recombination sites will result in inversion of the 
promoter, thus placing the promoter in the correct orientation to drive expression of the 
coding sequence. Similarly, if the promoter is initially in the correct orientation for 
30 expression and the recombination sites are in the same orientation, contacting the 
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recombination sites with the promoter can result in excision of the promoter fragment, thus 
stopping expression of the coding sequence. 

The methods of the invention are also useful for obtaining translocations of 
chromosomes, for example. In these embodiments, one recombination site is placed on one 
5 chromosome and a second recombination site that can serve as a substrate for recombination 
. with the first recombination site is placed on a second chromosome. Upon contacting the two 
recombination sites with a recombinase, recombination occurs that results in swapping of the 
two chromosome arms. For example, one can construct two strains of an organism, one 
strain of which includes the first recombination site and the second strain that contains the 
1 0 second recombination site. The two strains are then crossed, to obtain a progeny strain that 
includes both of the recombination sites. Upon contacting the sites with the recombinase, 
chromosome arm swapping occurs. 

Recombinases and Recombination Sites 

The methods of the invention use recombinase systems to achieve stable 
1 5 integration or other rearrangement of nucleic acids in eukaryotic cells. A recombinase 
system typically consists of three elements: two specific DNA sequences ("the 
recombination sites") and a specific enzyme ("the recombinase"). The recombinase catalyzes 
a recombination reaction between the specific recombination sites. 

Recombination sites have an orientation. In other words, they are not 
20 palindromes. The orientation of the recombination sites in relation to each other determines 
what recombination event takes place. The recombination sites may be in two different 
orientations: parallel (same direction) or opposite. When the recombination sites are present 
on a single nucleic acid molecule and are in a parallel orientation to each other, then the 
recombination event catalyzed by the recombinase is a typically an excision of the 
25 intervening nucleic acid, leaving a single recombination site. When the recombination sites 
are in the opposite orientation, then any intervening sequence is typically inverted. 

The recombinases used in the methods of the invention can mediate site- 
specific recombination between a first recombination site and a second recombination site 
that can serve as a substrate for recombination with the first recombination site. However, in 
30 the absence of an additional factor that is not normally present in eukaryotic cells, cannot 
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mediate recombination between two hybrid recombination sites that are formed upon 
recombination between the first recombination site and the second recombination site. 
Examples of these recombinases include, for example, the bacteriophage 0X31 integrase 
{see, e.g., Thorpe & Smith (1998) Proc. Nat'l. Acad. Sci. USA 95: 5505-5510; Kuhstoss & 
5 Rao (1991) /. Mol. Biol. 222: 897-890; US Patent No. 5,190,871), a phage P4 recombinase 
(Ow & Ausubel. (1983) J. Bacteriol. 155: 704-713), ^Listeria phage recombinase, a 
bacteriophage R4 Sre recombinase (Matsuura et al. (1996) /. Bacteriol. 178: 3374-3376), a 
CisA recombinase (Sato et al. (1990) J. Bacteriol. 172: 1092-1098; Stragier et al. (1989) 
Science 243: 507-512), an XisF recombinase (Carrasco et al. (1994) Genes Dev. 8: 74-83), 
10 and a transposon ln4451 TnpX recombinase (Bannam et al (1995) Mol. Microbiol. 16: 535- 
551; Crelin&Rood (1997) J. Bacteriol. 179: 5148-5156). 

Recombinase polypeptides, and nucleic acid^ that encode the recombinase 
polypeptides, are described in the art and can be obtained using routine methods. For 
example, a vector that includes a nucleic acid fragment that encodes the 3>C31 integrase is 
1 5 described in US Patent No. 5,1 90,871 and is available from the Northern Regional Research 
Laboratories, Peoria, Illinois 61604) under the accession number B- 18477. 

The recombinases can be introduced into the eukaryotic cells that contain the 
recombination sites at which recombination is desired by any suitable method. For example, 
one can introduce the recombinase in polypeptide form, e.g., by microinjection or other 
20 methods. In presently preferred embodiments, however, a gene that encodes the recombinase 
is introduced into the cells. Expression of the gene results in production of the recombinase, 
which then catalyzes recombination among the corresponding recombination sites. One can 
introduce the recombinase gene into the cell before, after, or simultaneously with, the 
introduction of the exogenous polynucleotide of interest. In one embodiment, the 
25 recombinase gene is present within the vector that carries the polynucleotide that is to be 
inserted; the recombinase gene can even be included within the polynucleotide. In other 
embodiments, the recombinase gene is introduced into a transgenic eukaryotic organism, 
e.g., a transgenic plant, animal, fungus, or the like, which is then crossed with an organism 
that contains the corresponding recombination sites. 
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Target Organisms 

The methods of the invention are useful for obtaining stable integration 
and/or rearrangement of DNA in any type of eukaryotic cell. For example, the methods are 
useful for cells of animals, plants, fungi, bacteria and other microorganisms. In some 

5 embodiments, the cells are part of a multicellular organism, e.g., a transgenic plant or 

animal. The methods of the invention are particularly useful in situations where transgenic 
materials are difficult to obtain, such as with transgenic wheat, corn, and animals. In these 
situations, finding the rare single copy insertion requires the prior attainment of a large 
number of independently derived transgenic clones, which itself requires great expenditure 

10 of effort. 

Among the plant targets of particular interest are monocots, including, for 
example, rice, com, wheat, rye, barley, bananas, palms, lilies, orchids, and sedges. Dicots are 
also suitable targets, including, for example, tobacco, apples, potatoes, beets, carrots, 
willows, elms, maples, roses, buttercups, petunias, phloxes, violets and sunflowers. Other 
15 targets include animal and fungal cells. These lists are merely illustrative and not limiting. 

Constructs for Introduction of Exogenous DNA into Target Cells 

The methods of the invention often involve the introduction of exogenous 
DNA into target cells. For example, nucleic acids that include one or more recombination 
sites are often introduced into the cells. The polynucleotide constructs that are to be 
20 introduced into the cells can include, in addition to the recombination site or sites, a gene or 
other functional sequence that will confer a desired phenotype on the cell. 

In some embodiments, a polynucleotide construct that encodes a recombinase 
is introduced into the eukaryotic cells in addition to the recombination sites. The 
recombinase-encoding polypeptide can be included on the same nucleic acid as the 
25 recombination site or sites, or can be introduced into the cell as a separate nucleic acid. The 
present invention provides nucleic acids that include recombination sites, as well as nucleic 
acids in which a recombinase-encoding polynucleotide sequence is operably linked to a 
promoter that functions in the target eukaryotic cell. 

Generally, a polynucleotide that is to be expressed (e.g., a recombinase- 
30 encoding polynucleotide or transgene of interest) will be present in an expression cassette, 
meaning that the polynucleotide is operably linked to expression control signals, e.g., 
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promoters and terminators, that are functional in the host cell of interest. The genes that 
encode the recombinase and the selectable marker, will also be under the control of such 
signals that are functional in the host cell. Control of expression is most easily achieved by 
selection of a promoter. The transcription terminator is not generally as critical and a variety 
5 of known elements may be used so long as they are recognized by the cell. 

A promoter can be derived from a gene that is under investigation, or can be a 
heterologous promoter that is obtained from a different gene, or from a different species. 
Where direct expression of a gene in all tissues of a transgenic plant or other organism is 
desired, one can use a "constitutive" promoter, which is generally active under most 
10 environmental conditions and states of development or cell differentiation. Suitable 

constitutive promoters for use in plants include, for example, the cauliflower mosaic virus 
(CaMV) 35S transcription initiation region and region VI promoters, the 1'- or 2'- promoter 
derived from T-DNA of Agrobacterium tumefaciens, and other promoters active in plant 
cells that are known to those of skill in the art. Other suitable promoters include the full- 
15 length transcript promoter from Figwort mosaic virus, actin promoters, histone promoters, 
tubulin promoters, or the mannopine synthase promoter (MAS). Other constitutive plant 
promoters include various ubiquitin or polyubiquitin promoters derived from, inter alia, 
Arabidopsis (Sun and Callis, Plant J., 11(5):1017-1027 (1997)), the mas, Mac or DoubleMac 
promoters (described in United States Patent No. 5,106,739 andbyComai et al., Plant Mol. 
20 Biol. 15:373-381 (1990)) and other transcription initiation regions from various plant genes 
known to those of skill in the art. Such genes include for example, ACT11 from Arabidopsis 
(Huang era/., Plant Mol. Biol. 33:125-139(1996)), Cat3 bom Arabidopsis (GenBankNo. 
U43147, Zhong et al., Mol. Gen. Genet. 251:196-203 (1996)), the gene encoding stearoyl- 
acyl carrier protein desaturase from Brassica napus (Genbank No. X74782, Solocombe et 
25 al., Plant Physiol, 104:1167-1 176 (1994)), GPcl from maize (GenBank No. X15596, 
Martinez et al., J. Mol. Biol 208:551-565 (1989)), and Gpc2 from maize (GenBank No. 
U45855, Manjunath et al., Plant Mol. Biol. 33:97-1 12 (1997)). Useful promoters for plants 
also include those obtained from Ti- or Ri-plasmids, from plant cells, plant viruses or other 
hosts where the promoters are found to be functional in plants. Bacterial promoters that 
30 function in plants, and thus are suitable for use in the methods of the invention include the 
octopine synthetase promoter, the nopaline synthase promoter, and the manopine synthetase 
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promoter. Suitable endogenous plant promoters include the ribulose-l,6-biphosphate 
(RUBP) carboxylase small subunit (ssu) promoter, the (a-conglycinin promoter, the 
phaseolin promoter, the ADH promoter, and heat-shock promoters. 

Promoters for use in E. coli include the T7, tip, or lambda promoters, a 
5 ribosome binding site and preferably a transcription termination signal. For eukaryotic cells, 
the control sequences typically include a promoter which optionally includes an enhancer 
derived from immunoglobulin genes, SV40, cytomegalovirus, etc., and a polyadenylation 
sequence, and may include splice donor and acceptor sequences. In yeast, convenient 
promoters include GAL1-10 (Johnson and Davies (1984) Mol. Cell. Biol. 4:1440-1448) 
10 ADH2 (Russell et al. (1983) J. Biol. Chem. 258:2674-2682), PH05 (EMBO J. (1982) 6:675- 
680), and MFa (Herskowitz and Oshima (1982) in The Molecular Biology of the Yeast 
Saccharomyces (eds. Strathem, Jones, and Broach) Cold Spring Harbor Lab., Cold Spring 
Harbor, N.Y., pp. 181-209). 

Alternatively, one can use a promoter that directs expression of a gene of 
15 interest in a specific tissue or is otherwise under more precise environmental or 

developmental control. Such promoters are referred to here as "inducible" or "repressible" 
promoters. Examples of environmental conditions that may effect transcription by inducible 
promoters include pathogen attack, anaerobic conditions, ethylene or the presence of light. 
Promoters under developmental control include promoters that initiate transcription only in 
20 certain tissues, such as leaves, roots, fruit, seeds, or flowers. The operation of a promoter 
may also vary depending on its location in the genome. Thus, an inducible promoter may 
become fully or partially constitutive in certain locations. Inducible promoters are often used 
to control expression of the recombinase gene, thus allowing one to control the timing of the 
recombination reaction. Examples of tissue-specific plant promoters under developmental 
25 control include promoters that initiate transcription only in certain tissues, such as fruit, 
seeds, or flowers. The tissue-specific E8 promoter from tomato is particularly useful for 
directing gene expression so that a desired gene product is located in fruits. See, e.g., Lincoln 
et al. (1988) Proc Nat'l. Acad. Set. USA 84: 2793-2797; Deikman et al. (19S8) EMBO J. 7: 
3315-3320; Deikmanef al. (1992) Plant Physiol. 100: 2013-2017. Other suitable promoters 
30 include those from genes encoding embryonic storage proteins. Examples of environmental 
conditions that may affect transcription by inducible promoters include anaerobic conditions, 
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elevated temperature, or the presence of light. Additional organ-specific, tissue-specific 
and/or inducible foreign promoters are also known (see, e.g., references cited in Kuhlemeier 
et al (1987) Ann. Rev. Plant Physiol. 38:221), including those 1,5-ribulose bisphosphate 
carboxylase small subunit genes of Arabidopsis thaliana (the "ssu" promoter), which are 
5 light-inducible and active only in photosynthetic tissue, anther-specific promoters (EP 

344029), and seed-specific promoters of, for example, Arabidopsis thaliana (Krebbers et al. 
(1988) Plant Physiol. 87:859). Exemplary green tissue-specific promoters include the maize 
phosphoenol pyruvate carboxylase (PEPC) promoter, small submit ribulose bis-carboxylase 
promoters (ssRUBISCO) and the chlorophyll a/b binding protein promoters. The promoter 
10 may also be a pith-specific promoter, such as the promoter isolated from a plant TrpA gene 
as described in International Publication No. W093/07278. 

Inducible promoters for other organisms include, for example, the arabinose 
promoter, the lacZ promoter, the metallothionein promoter, and the heat shock promoter, as 
well as many others that are known to those of skill in the art. An example of a repressible 
1 5 promoter useful in yeasts such as S. pombe is the Pmnt promoter, which is repressible by 
vitamin Bl. 

Typically, constructs to be introduced into these cells are prepared using 
recombinant expression techniques. Recombinant expression techniques involve the 
construction of recombinant nucleic acids and the expression of genes in transfected cells. 

20 Molecular cloning techniques to achieve these ends are known in the art. A wide variety of 
cloning and in vitro amplification methods suitable for the construction of recombinant 
nucleic acids are well-known to persons of skill. Examples of these techniques and 
instructions sufficient to direct persons of skill through many cloning exercises are found in 
Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, 

25 Volume 1 52, Academic Press, Inc., San Diego, CA (Berger); and Current Protocols in 
Molecular Biology, F.M. Ausubel et al., eds., Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1998 Supplement) 
(Ausubel). 

The construction of polynucleotide constructs generally requires the use of 
30 vectors able to replicate in bacteria. A plethora of kits are commercially available for the 
purification of plasmids from bacteria. For their proper use, follow the manufacturer's 
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instructions {see, for example, EasyPrepJ, FlexiPrepJ, both from Pharmacia Biotech; 
StrataCleanJ, from Stratagene; and, QIAexpress Expression System, Qiagen). The isolated 
and purified plasmids can then be further manipulated to produce other plasmids, used to 
transfect cells or incorporated into Agrobacterium tumefaciens to infect and transform 
plants. Where Agrobacterium is the means of transformation, shuttle vectors are constructed. 
Cloning in Streptomyces or Bacillus is also possible. 

Selectable markers are often incorporated into the polynucleotide constructs 
and/or into the vectors that are used to introduce the constructs into the target cells. These 
markers permit the selection of colonies of cells containing the polynucleotide of interest. 
Often, the vector will have one selectable marker that is functional in, e.g., E. coli, or other 
cells in which the vector is replicated prior to being introduced into the target cell. Examples 
of selectable markers for£. coli include: genes specifying resistance to antibiotics, i.e., 
ampicillin, tetracycline, kanamycin, erythromycin, or genes conferring other types of 
selectable enzymatic activities such as fi-galactosidase, or the lactose operon. Suitable 
selectable markers for use in mammalian cells include, for example, the dihydrofolate 
reductase gene (DHFR), the thymidine kinase gene (TK), or prokaryotic genes conferring 
drug resistance, gpt (xanthme-guanine phosphoribosyltransferase, which can be selected for 
with mycophenolic acid; neo (neomycin phosphotransferase), which can be selected for with 
G418, hygromycin, or puromycin; and DHFR (dihydrofolate reductase), which can be 
selected for with methotrexate (Mulligan & Berg (1 981) Proc. Nat 7. Acad. Sci. USA 78: 
2072; Southern & Berg (1982) J. Mol. Appl. Genet. 1: 327). 

Selection markers for plant cells often confer resistance to a biocide or an 
antibiotic, such as, for example, kanamycin, G 418, bleomycin, hygromycin, or 
chioramphenicol, or herbicide resistance, such as resistance to chlorsulfuron or Basta. 
Examples of suitable coding sequences for selectable markers are: the neo gene which codes 
for the enzyme neomycin phosphotransferase which confers resistance to the antibiotic 
kanamycin (Beck et al (1982) Gene 19:327); the hyg (hpf) gene, which codes for the enzyme 
hygromycin phosphotransferase and confers resistance to the antibiotic hygromycin (Gritz 
and Davies (1983) Gene 25:179); and the bar gene (EP 242236) that codes for 
phosphinothricin acetyl transferase which confers resistance to the herbicidal compounds 
phosphinothricin and bialaphos. 



18 



WO 01/07572 



PCT/USOO/19983 



If more than one exogenous nucleic acid is to be introduced into a target 
eukaryotic cell, it is generally desirable to use a different selectable marker on each 
exogenous nucleic acid. This allows one to simultaneously select for cells that contain both 
of the desired exogenous nucleic acids. 

Methods for Introducing Constructs into Target Cells 

The polynucleotide constructs that include recombination sites and/or 
recombinase-encoding genes can be introduced into the target cells and/or organisms by any 
of the several means known to those of skill in the art. For instance, the DNA constructs can 
be introduced into plant cells, either in culture or in the organs of a plant by a variety of 
conventional techniques. For example, the DNA constructs can be introduced directly to 
plant cells using Holistic methods, such as DNA particle bombardment, or the DNA 
construct can be introduced using techniques such as electroporation and microinjection of 
plant cell protoplasts. Particle-mediated transformation techniques (also known as 
"biolistics") are described in Klein et al., Nature, 327:70-73 (1987); Vasil, V. et ai, 
Bio/Technol. 1 1 :1553-1558 (1993); and Becker, D. et ai, Plant J., 5:299-307 (1994). These 
methods involve penetration of cells by small particles with the nucleic acid either within the 
matrix of small beads or particles, or on the surface. The biolistic PDS-1000 Gene Gun 
(Biorad, Hercules, CA) uses helium pressure to accelerate DNA-coated gold or tungsten 
microcarriers toward target cells. The process is applicable to a wide range of tissues and 
cells from organisms, including plants, bacteria, fungi, algae, intact animal tissues, tissue 
culture cells, and animal embryos. One can employ electronic pulse delivery, which is 
essentially a mild electroporation format for live tissues in animals and patients. Zhao, 
Advanced Drug Delivery Reviews 17:257-262 (1995). 

Other transformation methods are also known to those of skill in the art. 
Microinjection techniques are known in the art and well described in the scientific and patent 
literature. The introduction of DNA constructs using polyethylene glycol (PEG) precipitation 
is described in Paszkowski et al., EMBOJ. 3:2717 (1984). Electroporation techniques are 
described in Fromm et al., Proc. Natl. Acad. Sci. USA, 82:5824 (1985). PEG-mediated 
transformation and electroporation of plant protoplasts are also discussed in Lazzeri, P., 
Methods Mol. Biol. 49:95-106 (1995). Methods are known for introduction and expression of 
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heterologous genes in both monocot and dicot plants. See, e.g., US Patent Nos. 5,633,446, 
5,317,096, 5,689,052, 5,159,135, and 5,679,558; Weising et al. (1988) Ann. Rev. Genet. 
22:421-477. Transformation of monocots in particular can use various techniques including 
electroporation (e.g., Shimarnoto et al., Nature (1992), 338:274-276); biolistics (e.g., 
5 European Patent Application 270,356); and Agrobacterium (e.g., Bytebier et al., Proc. Nat'l 
Acad. Sci. USA (1987) 84:5345-5349). 

For transformation of plants, DNA constructs may be combined with suitable 
T-DNA flanking regions and introduced into a conventional Agrobacterium twnefaciens host 
vector. The virulence functions of the A. tumefaciens host will direct the insertion of a 
10 transgene and adjacent marker gene(s) (if present) into the plant cell DNA when the cell is 
infected by the bacteria. Agrobacterium tumefaciens-raediteted transformation techniques 
are well described in the scientific literature. See, for example, Horsch et al. Science, 
233:496-498 (1984), Fraley et al., Proc. Natl. Acad. Sci. USA, 80:4803 (1983), and 
Hooykaas, Plant Mol Biol., 13:327-336 (1989), Bechtold et al., Comptes Rendus De L 
15 Academie Des Sciences Serie Iii-Sciences DeLa Vie-Life Sciences, 316:1194-1199 (1993), 
Valvekens et al, Proc. Natl. Acad. Sci. USA, 85:5536-5540 (1988). For a review of gene 
transfer methods for plant and cell cultures, see, Fisk et al., Scientia Horticulturae 55:5-36 
(1993) andPotrykus, CIBA Found. Symp. 154:198 (1990). 

Other methods for delivery of polynucleotide sequences into cells include, for 
20 example liposome-based gene delivery (Debs and Zhu (1993) WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; Brigham 
(1991) WO 91/06309; and Feigner et al. (1987) Proc. Natl. Acad. Sci. USA 84: 7413-7414), 
as well as use of viral vectors (e.g., adenoviral (see, e.g., Berns et al. (1995) Ann. NY Acad. 
Sci. Ill: 95-104; Ah et al. (1994) Gene Ther. 1 : 367-384; and Haddada et al. (1995) Curr. 
25 Top. Microbiol. Immunol. 199 ( Pt 3): 297-306 for review), papillomaviral, retroviral (see, 
e.g., Buchscher etal. (1992) J. Virol. 66(5) 2731-2739; Johann et al. (1992)/. Virol. 66 
(5):1635-1640 (1992); Sommerfelt et al, (1990) Virol. 176:58-59; Wilson et al. (1989)7. 
Virol. 63:2374-2378; Miller et al., /. Virol. 65:2220-2224 (1991); Wong-Staal et al, 
PCT/US94/05700, andRosenburg and Fauci (1993) in Fundamental Immunology, Third 
30 Edition Paul (ed) Raven Press, Ltd., New York and the references therein, and Yu et al. 
Gene Therapy (1994) supra.), and adeno-associated viral vectors (see, West et al. (1987) 
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Virology 160:38-47; Carter et al. (1989) U.S. Patent No. 4,797,368; Carter et al. WO 
93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801; Muzyczka (1994) J. Clin. 
Invst. 94:1351 and Samulski (supra) for an overview of AAV vectors; see also, Lebkowski, 
U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol. Cell. Biol. 5(1 1):3251-3260; Tratscbin 
5 et al. (1984) Mol. Cell. Biol, 4:2072-2081; Hennonat and Muzyczka (1984) Proc. Natl. 
Acad. Sci. USA, 81:6466-6470; McLaughlin et al. (1988) and Samulski et al. (1989) J. 
Virol, 63:03822-3828), and the like. 

Methods by which one can analyze the integration pattern of the introduced 
exogenous DNA are well known to those of skill in the art. For example, one can extract 
10 DNA from the transformed cells, digest the DNA with one or more restriction enzymes, and 
hybridize to a labeled fragment of the polynucleotide construct. The inserted sequence can 
also be identified using the polymerase chain reaction (PCR). See, e.g., Sambrook et al, 
Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989 for descriptions of these and other suitable methods. 

15 Regeneration of Transgenic Plants and Animals 

The methods of the invention are particularly useful for obtaining transgenic 
and chimeric multicellular organisms that have a stably integrated exogenous polynucleotide 
or other stable rearrangement of cellular nucleic acids. Methods for obtaining transgenic and 
chimeric organisms, both plants and animals, are well known to those of skill in the art 
20 Transformed plant cells, derived by any of the above transformation 

techniques, can be cultured to regenerate a whole plant which possesses the transformed 
genotype and thus the desired phenotype. Such regeneration techniques rely on 
manipulation of certain phytohormones in a tissue culture growth medium, typically relying 
on a biocide and/or herbicide marker which has been introduced together with the desired 

25 nucleotide sequences. Plant regeneration from cultured protoplasts is described in Evans et 
al, Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp. 124-176, 
Macmillian Publishing Company, New York (1983); and in Binding, Regeneration of 
Plants, Plant Protoplasts, pp. 21-73, CRC Press, Boca Raton, (1985). Regeneration can also 
be obtained from plant callus, explants, somatic embryos (Dandekar et al, J. Tissue Cult. 

30 Meth., 12:145 (1989); McGranahan et al. Plant Cell Rep., 8:512 (1990)), organs, or parts 
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thereof. Such regeneration techniques are described generally in Klee et al., Ann. Rev. of 
Plant Phys., 38:467-486 (1987). 

The methods are useful for producing transgenic and chimeric animals of 
most vertebrate species. Such species include, but are not limited to, nonhuman mammals, 

5 including rodents such as mice and rats, rabbits, ovines such as sheep and goats, porcines 
such as pigs, and bovines such as cattle and buffalo. Methods of obtaining transgenic 
animals are described in, for example, Puhler, A., Ed., Genetic Engineering of Animals, 
VCH Publ., 1993; Murphy and Carter, Eds., Transgenesis Techniques : Principles and 
Protocols {Methods in Molecular Biology, Vol. 18), 1993; and Pinkert, CA, Ed., Transgenic 

10 Animal Technology : A Laboratory Handbook, Academic Press, 1994. Transgenic fish 
having specific genetic modifications can also be made using the claimed methods. See, 
e.g., Iyengar etal. (1996) Transgenic Res. 5: 147-166 for general methods of making 
transgenic fish. 

One method of obtaining a transgenic or chimeric animal having specific 

1 5 modifications in its genome is to contact fertilized oocytes with a vector that includes the 
polynucleotide of interest flanked by recombination sites. For some animals, such as mice 
fertilization is performed in vivo and fertilized ova are surgically removed. In other animals, 
particularly bovines, it is preferably to remove ova from live or slaughterhouse animals and 
fertilize the ova in vitro. See DeBoer et al., WO 91/08216. In vitro fertilization permits the 

20 modifications to be introduced into substantially synchronous cells. Fertilized oocytes are 
then cultured in vitro until a pre-implantation embryo is obtained containing about 16-150 
cells. The 16-32 cell stage of an embryo is described as a morula. Pre-implantation 
embryos containing more than 32 cells are termed blastocysts. These embryos show the 
development of a blastocoel cavity, typically at the 64 cell stage. If desired, the presence of 

25 a desired exogenous polynucleotide in the embryo cells can be detected by methods known 
to those of skill in the art. Methods for culturing fertilized oocytes to the pre-implantation 
stage are described by Gordon et al. (1984) Methods Enzymol. 101: 414; Hogan et al. 
Manipulation of the Mouse Embryo: A Laboratory Manual, C.S.H.L. N.Y. (1986) (mouse 
embryo); Hammer et al. (1985) Nature 315: 680 (rabbit and porcine embryos); Gandolfi et 

30 al. (1987) J. Reprod. Fert. 81: 23-28; Rexroad et al. (1988) / Anim. Sci. 66: 947-953 (ovine 
embryos) and Eyestone et al. (1989) J. Reprod. Fert. 85: 715-720; Camous et al. (1984) J. 
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Reprod. Fert. 72: 779-785; and Heyman ef al. (1987) TheriogenologyTl: 5968 (bovine 
embryos). Sometimes pre-implantation embryos are stored frozen for a period pending 
implantation. Pre-implantation embryos are transferred to an appropriate female resulting in 
the birth of a transgenic or chimeric animal depending upon the stage of development when 
5 the transgene is integrated. Chimeric mammals can be bred to form true germline transgenic 
animals. 

Alternatively, the methods can be used to obtain embryonic stem cells (ES) 
that have a single copy of the desired exogenous polynucleotide. These cells are obtained 
from preimplantation embryos cultured in vitro. See, e.g., Hooper, ML, Embryonal Stem 

10 Cells : Introducing Planned Changes into the Animal Germline (Modern Genetics, v. 1), 
Int'l. Pub. Distrib., Inc., 1993; Bradley et al. (1984) Nature 309, 255-258. Transformed ES 
cells are combined with blastocysts from a non-human animal. The ES cells colonize the 
embryo and in some embryos form the germ line of the resulting chimeric ammal. See 
Jaenisch, Science, 240: 1468-1474(1988). Alternatively, ES cells or somatic cells that can 

1 5 reconstitute an organism ("somatic repopulating cells") can be used as a source of nuclei for 
transplantation into an enucleated fertilized oocyte giving rise to a transgenic mammal. See, 
e.g., Wilmut et al. (1997) Nature 385: 810-813. 

EXAMPLES 

The following examples are offered to illustrate, but not to limit the present 

20 invention. 

Example 1 

The <EC31 Recombination System Functions in Schizasaccharontvces vombe 

This Example demonstrates that the Streptomyces bacteriophage OC31 site- 
specific recombination system functions in eukaryotic cells. A bacteriophage attachment site 
25 (attP) was introduced into a chromosome of Schizosaccharomyces pombe at the S. pombe 
leu I locus. This target strain was subsequently transformed with a plasmid that contains the 
bacterial attachment site (attB) linked to a ura4* selectable marker. When co-transformed 
with a second plasmid harboring the OC31 integrase gene, high efficiency transformation to 
Ura + was observed under conditions where the integrase gene was expressed. 
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Southern analysis of the integration events shows insertion of the attB-ura4 
plasmid into the attP site of the leul locus. Nucleotide sequence of the hybrid junctions 
revealed that the atlB x attP recombination reaction is precise. 

Materials and Methods 

Recombinant DNA 

Standard methods were used throughout. E. coli strain XL2-Blue {recAl 
endAl gyrA96 thi-1 hsdRl 7 supE44 relAl lac \FproAB lacR ZAM/STnl 0 (Tel*) Amy 
Cam 1 ], Strategene) served as host for DNA constructs. 

Media 

Fission yeast strains were grown on minimal medium (EMM-low glucose, 
from BiolOl) supplemented as needed with 225 mg/1 adenine, histidine, leucine or uracil. 
Minimal plates with 5-FOA (5-floroorotic acid, from Zymo Research, Inc.) were prepared 
according to Grimm et al. ((1988) Mol. Gen. Genet. 215: 81-86) and were supplemented 
with adenine, histidine, and leucine. When used, thiamine was added to 5 ug/ml. 

S. pombe with &C31 attP target 

The 84 bp <DC31 attP site (abbreviated as PP'), isolated as an^pal-Sad 
fragment from P HS282 (Thorpe & Smith (1998) Proc. Nafl. Acad. Sci. USA 95:5505-5510) 
was cloned into the same sites of the S. pombe integrating vector pJK148 (Keeney & Boeke 
(1994) Genetics 1 36:849-856) to make pLT44. This plasmid was targeted to the 5. pombe 
) leul-32 allele by bthium acetate mediated transformation with Ndel cut DNA. The recipient 
host FY527 (h- ade6-M216 his3-Dl leul-32 ura4-D18), converted to Leu + by homologous 
recombination with pLT44, was examined by Southern analysis. One Leu + transformant, 
designated FY527attP, was found to contain a single copy of pLT44. Another transformant, 
designated FY527attPx2, harbors a tandem plasmid insertion. 

5 Integrative urzf vector with (PC31 atiB site 

The S. pombe ura4* gene, excised from pTZura4 (S. Forsburg) on a 1 .8 kb 
EcoRl-BamHI fragment, was inserted into pJK148 cut with the same enzymes to create 
P LT40. The <DC31 attB site (abbreviated as BB'), isolated from pHS21 as a 500 bp Bamm- 
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Xbal fragment, was ligated into pLT40 cut with those enzymes, creating pLT42. Most of the 
leul gene was removed from pLT42 by deleting a Xhol fragment to create pLT45. This 
removed all but 229 bp of leul from pLT45 and reduced its transformation efficiency to that 
of a plasmid without any leul homology. pLT50, which has a second attB site in the same 
5 orientation immediately on the other side of ura4, was constructed by first subcloning the 
attB BamM-Sad fragment from pLT42 into pUC19, excising it with EcoRl and SaUl, and 
subsequently inserting it into pLT45 cut with EcoRl and Xhol. The second attB site in the 
final construct was sequenced once on each strand and found to be identical to the first attB 
site. 

10 Linear DNA transformation 

The attB-ura4*-attB linear DNA was prepared as an Attil-AlwNl fragment 
purified from pLT50, or as a PCR product using pLTSO as template. PCR was conducted 
using standard conditions with a T3 primer and a second primer (5' ggc cct gaa art gtt get tct 
gec 3') corresponding to the plasmid backbone of pJK148. 

15 Repressible synthesis of OC31 integrase 

The S. pombe Pmnt promoter, repressive by vitamin B 1 , was excised as a 1 .2 
kb Pstl-Sad fragment from pM0147 and inserted into the his3 + , arsl vector pBG2 (Ohi et 
al. (1996) Gene 174: 315-318) cut with the same enzymes, creating pLT41. A 2.0 kb Sad 
fragment containing the <DC31 int coding region was transferred from pHS33 (Thorpe & 

20 Smith (1998) supra.) to the Sad site of pLT41 . A clone in which the int coding region is 
oriented such that expression is under the control of Pmnt was designated pLT43. 

Molecular analyses 

Southern analysis was performed using the Genius™ system from Boehringer 
Mannheim. A 998 bp internal EcoKV fragment of leul, a 1.8 kb fragment of ura4 , and the 
25 2.0 kb <PC31 int gene were digoxigen-labeled by the random primer method and used as 
probes. Polymerase chain reaction was performed on a Perkin Elmer Cetus Gene Amp PCR 
9600 using Stratagene Turbo PFU enzyme or VENT polymerase. The standard T3 and T7 
primers were used where possible. The ura4 primer (5' gtc aaa aag ttt cgt caa tat cac 3' 
(SEQ ID NO: 1)) and the pJK148 primers were purchased from Operon Technologies. For 
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all PCR reactions an annealing temperature of 5 1°C and a 30-second extension time were 
used. 

Results and Discussion 

Inserting a target site into the S. pombe genome 

5 To create a host strain with a target site for OC3 1 -mediated integration, the 

•DC31 attP site was inserted by homologous recombination into the leul locus of the fission 
yeast genome to form the Leu + strain FY527attP (Figure 1 A). Previous studies showed that 
when S. pombe DNA is cleaved with Xbal and probed with an internal 1 kb fragment of the 
lexiC gene, the probe detects a 14 kb band (Keeney & Boeke (1994) Genetics 136: 849-856). 
10 Insertion of the leu + plasmid pJK148 at the /eal-32 locus results in detection of 3 and 18 kb 
bands (Figure 1A). Since P LT44 differs from pJK148 by the inclusion of an 84 bp 4>C31 
att? element, integration of pLT44 at leul-32 yielded the same 3 kb and 18 kb hybridization 
pattern in FY527artP. The absence of other hybridizing fragments indicates that the pLT44 
DNA resides as a single integrated copy. 

15 <PC31-integase-mediated transformation 

FY527atttP was transformed with pLT45, which harbors ura4 and an attB 
sequence (BB') but lacks an origin of replication. This construct was introduced by itself or 
with pLT43, a his3 + replicating vector that produces OC31 integrase. The inclusion of 
pLT43 increased the number of Ura + transfonnants an average of 15 fold (T able 1). This 
20 enhancement cannot be attributed to the recombination between pLT45 and the replication- 
proficient pLT43, as its effect is dependent on integrase gene expression. Transcription of 
the integrase gene is under the control oiPmnt, a promoter repressive by high levels of 
vitamin Bl (Maundrell, K. (1993) Gene 123: 127-130). The repression is not absolute 
(Forsburg, S. L. (1993) Nucleic Acids Res 21: 2955-2956) but reduces the production of 
25 integrase protein. When thiamine was added to the growth medium, the number of Ura + 
transfonnants decreased to near background level. The frequency of Ura + tranformants did 
not change significantly whether or not the integrase plasmid was co-selected by omission of 
histidine from the medium. The transformation competency of FY527atttP was estimated 
from the number of His + transfonnants obtained with pLT43 or its progenitor plasmid pBG2. 
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Compared to the frequency of either replicating plasmid, the pLT43-dependent 
transformation of FY527attP averaged about 15%. 

Table 1 : Integrase-dependent site-specific insertion in S. pombe FY527attP. 

DNA Selection Bl Transformants Relative Class a Class b Others 
(lUg) (5ug) per 10 7 cells Value 5 




pLT43 


His* 


7200 (±2200) 


100 








pLT45 


Ura + 


63 (+10) 


1 


0%* 


0%* 


100%* 


pLT45 + 




1100 (±120) 


15 


88%* 


6% f 


6%* 


pLT43 


Ura + 












pLT45 + 


Ura + 


+ 120 (±16) 


2 


0%* 


25%* 


75%* 



P LT43 



♦From three independent experiments 

'(transformation efficiency of the DNA of interest)/(transformation efficiency of pLT43) x 
100 
10 *n=16 
*n-8 

<PC31-integrase promoted attP x attB recombination 

Recombination between the P LT45-encoded <DC31 attB element and the 
chromosomally situated attP sequence would incorporate the circular DNA into the Ieul 

1 5 locus as depicted in Figure IB. If this reaction occurs, A&al-fractionated genomic DNA 

from the Ura + transformants is probed with Ieul DNA, the 3 kb band will remain unchanged, 
while the 18 kb band will increase to -23 kb (Figure 1C). Randomly selected Ura + colonies 
were examined by hybridization analysis. Of eight isolates derived from experiments where 
OC31 integrase gene expression was derepressed by the omission of tluamine, seven showed 

20 the presence of this -23 kb band. This same size band hybridized to the uraA probe. This 
contrasts with the lack of ura<\ hybridization with the parental strain, as expected from its 
ura4-Dl 8 deletion allele. One of these seven isolates showed additional bands hybridizing 
to both probes. This candidate appears to have a DNA rearrangement at the Ieul locus in 
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addition to a site-specific recombination event. The leul rearrangement was probably 
catalyzed by the operative S. pombe homologous recombination system. The remaining 
isolate had not experienced a site-specific recombination event and appeared to have gained 
uracil prototrophy by recombination between pLT45 and pLT43. Of these eight isolates, 
5 half were selected as both Ura + and His + , but no significant difference was found between 
this group and the group selected for Ura* only. 

From transformation experiments plated in the presence of vitamin Bl, an 
equal number of Ura + transformants was examined by DNA hybridization. The thiarnine- 
repressible Pnmt promoter is expected to limit integrase production, and thereby site-specific 
1 0 integration. Two of the eight Ura* candidates isolated from this low frequency 

transformation showed a band of 23 kb hybridizing to leul and to the ura4 probe. However, 
since both probes detected an additional band, they do not represent correct integration 
events, and we grouped them as class b integrants. In the other six isolates, the hybridization 
patterns are difficult to interpret. In some of them, the 3 kb band was not detected by the 
1 5 leul probe, as though the locus has experienced some rearrangement. In many of them, the 
weak hybridization to ura4 suggests that the Ura + phenotype may not be due to the stable 
maintenance of pLT45 in the genome. 

To ascertain the proportion of transformants maintaining the integrase 
plasmid in the absence of selection, the blots were re-probed with the integrase gene 
20 sequence. Those selected as Ura + His* would be expected to maintain the plasmid, and did 
so, as the hybridization revealed. Five of the eight isolates selected as Ura* without regard to 
the His phenotype also gave bands hybridizing to the integrase probe. To confirm that loss 
of int would not affect stable integration, another set of randomly chosen Ura cells were 
grown non-selectively for a number of generations and screened for His progeny that have 
25 lost pLT43. The analysis of eight representative Ura + His" clones showed that all had a 
single copy of pLT45 precisely integrated at the chromosome-situated attP site. The DNA 
of these integrants did not hybridize with the integrase probe. In contrast, the background 
frequency Ura + clones derived by transformation of pLT45 alone gave the parental 
configuration of hybridizing bands at the leul locus and additional faint bands at 5 kb and 7 
30 kb. These observations are consistent with either integration of pLT45 elsewhere in the 
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genome, or maintenance of the plasmid in some cells despite the lack of a S. pombe 
replication origin. 

Conservative site-specific recombination 

PCR was used to retrieve the attPlattB recombinant junctions from three 
representative Ura + candidates. One of the hybrid sites, attR (PET) would be flanked by T3 
and T7 promoters; the other site, attL (BP') by the T3 promoter and ura4 DNA (Figure 1C). 
In each case, primer pairs directed to these sequences amplified a band of the expected size 
while the original attP (PP') was no longer found. This contrasts with the parental strain 
FY527attP, where attP, but neither attL nor attR, was detected. The nucleotide sequence of 
three representative attL and attR PCR products showed the absence of accompanying 
mutations. Hence, as in bacteria and mammalian cells, <DC31 mediated site-specific 
recombination in S. pombe is a conservative recombination reaction. 

<PC31 integrase does not excise integrated molecules 

Thorpe and Smith ((1998) Proc. Natl. Acad. Sci. USA 95: 5505-5510) did not 
; detect reversal of the <DC3 1 integrase reaction by analysis of gel-fractionated DNA 

fragments. We examined the possibility of a reverse reaction through a genetic selection 
strategy. The precise integration of pLT45 into FY527attP was confirmed for three clones 
by Southern analysis; these strains were then re-transformed with pLT43. Excision of 
pLT45 would result in loss of the ura4 + marker, the Ura phenotype can be scored on plates 
0 with 5-FOA (Grimm etal. (1988) Mol. Gen. Genet. 215: 81-86). The frequencies of Ura" 
segregants from cultures of the three Ura + His progenitors were 5.7 x lO" 4 , 7.1 x 10" 4 and 5.6 
x 10 4 . In contrast, the frequencies of Ura - colonies from the three Ura + His + derivatives were 
somewhat higher: 1.1 x 10" 2 , 3.8 x 10" 3 and 2.3 x 10" 3 , respective 19-, 5- and 4-fold increases. 
When a control vector lacking the integrase gene, pBG2, was used instead, increased rates 
:5 of 5-FOA resistance were also found: l.Ox 10" 2 , 1.0 x 10' 2 , and 8.0 x 10°, respectively. The 
transformation process itself appears mutagenic. 

Three Ura" His + clones from each of the three cultures that had been 
transformed by pLT45 were analyzed by Southern blotting. One isolate had a DNA pattern 
consistent with stable integration of pLT45 into FY527attP. Therefore, in this clone, the 
}0 Ura' phenotype was caused by a mutation that did not appreciably alter the restriction 
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pattern, rather than by reversal of the site-specific recombination reaction. The second clone 
showed a Southern pattern characteristic of FY527attP lacking apLT45 insertion, the third 
had a pattern consistent with a mixture of two types of cells, those like FY527attP without a 
P LT45 insertion, and those like the FY527attP progenitor strain FY527. The latter structure 

5 could arise from intrachromosomal homologous recombination between the leu repeats, 
reversing the insertion of P LT44 (Figure M). If precise excision of the integrated plasmid 
DNA occurred in the latter two candidates, the attP site would be regenerated; this would be 
detectable with PCR. The size of the PCR product was that expected for an intact hybrid 
site, the presence of the hybrid site was confirmed by sequencing the PCR product. These 

10 observations are consistent with the idea that deletion of the uraA gene occurred by some 
mechanism other than OC31 -mediated excision. 

Summary 

The integration of a circular molecule at a single target site was an efficient 
process yielding precise insertions in nearly all transformants. The few aberrant events we 

1 5 observed are probably largely attributable to the S. pombe recombination system acting on 
the leul repetitive DNA. "When integrase production was limited through the repression of 
its promoter, the number of transformants was reduced to near background level. Under 
these conditions, few of the recovered transformants were derived from <DC31 site-specific 
recombination. Functional operation of the cpC31 site-specific recombination system in 

20 eukaryotic cells presents new opportunities for the manipulation of transgenes and 

chromosomes. The OC3 1 system can be used with selective placement of attB and attP site 
to delete, invert or insert DNA. An important feature of this system is that the attB x attP 
reaction is irreversible in the absence of an excision-specific protein. 

Example 2 

2 5 The (PC31 Integrase Functions in CHO Cells to C reate Stable Integration 

This Example describes an experiment in which the OC3 1 integrase was 
tested for ability to mediate recombination between attB and attP recombination sites in 
Chinese hamster ovary (CHO) cells. 
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Methods 

The CHO cell line 51YT21 1 was transfected with the attP-containing plasmid 
pFYl, which included a selectable marker that confers zeocin resistance (Figure 2). After 
being single colony purified twice, six zeocin resistant cell lines were isolated. Analysis by 
Southern DNA hybridization confirmed that each of the six cell lines had at least one 
molecule of pFYl integrated into the genome. 

Each of the six cell lines was transfected with the ar/5-containing plasmid 
P FY9 and the inr-containing plasmid P FY6 to test for site-specific recombination between 
the attB sites on pFY9 and the attP site on the chromosomal copy of pFYl. As control, the 
same cell lines were transfected with P FY9, but without the mr-contaiinng pFY6. The P FY9 
plasmid included a neomycin resistance selectable marker under the control of an SV40 
early promoter, as well as a green fluorescent protein (GFP) coding sequence that is not 
linked to a promoter. Site-specific recombination would thus be expected to place the GFP 
coding sequence under the control of a human cytomegalovirus promoter that was included 
in pFYl, resulting in expression of GFP. 



Transfection results: Neomycin resistant colonies were placed under the 
microscope to observe whether the GFP gene is active. A large percentage of the cells 
transfected with P FY9+pFY6, but only a few of the cells transfected with the pFY9 alone 
showed GFP activity. This is consistent with site-specific integration of pFY9 when co- 
transfected with P FY6, and random insertion of pFY9 in the absence of a co-transfected int 
gene. 

PGR analysis was conducted using a primer set that corresponds to Pc 
(human cytomegalovirus promoter) and GFP (Figure 2). These primers would be expected 
to amplify a band of -0.6 kb corresponding to the integration junction. As neomycin 
resistant colonies could arise from both site-specific integration and random integration, and 
that the GFP marker does not confer a selectable trait, it was difficult to obtain pure cultures 
of integrant clones. Therefore, pools of neomycin resistant cells from each transfected line 
were subjected to PCR analysis to examine if the integration junction were present among 
the neomycin resistant cells. A band of the expected size of -0.6 kb was obtained from two 
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lines. This indicates that the attB x attP recombination junction has formed linking Pc with 
GFP. 

Example 3 

fl>r^1 Intense Ca talyzes Site-Spe rific Recombination in CHO Cells 

This Example describes a second experiment in which the <DC31 integrase 
was tested for ability to integrate a DNA molecule into the chromosome of Chinese hamster 
ovary (CHO) cells through the recombination between attP and attB sites. 



Plasmid constructs 

rhmmosomal nttR target con structs dFY12 nFY14 and pFY15 
The plasmid pcDNA3. 1 /His/lacZ (Invitrogen) was used as a vector backbone. 
A synthetic oligonucleotide contained different length of the attB site, flanked by HintM 
and Kpnl sites, was inserted between the HmWi (AAGCTT) and Kpnl (GGTACC) sites of 
pcDNA3.1/ffis/lacZ. 

The plasmid P FY12 contains 90 bp of the attB sequence (AAGCTT 
gacggtctcg aagccgcggt gcgggtgcca gggcgtgccc ttgggctccc cgggcgcgtactccacctcacccatctggt 
ccatcatgat GGTACC) (SEQ ID NO: 2). 

The plasmid P FY14 contained 50 bp of the attB site (AAGCTT gcgggtgcca 
gggcgtgccc ttgggctccc cgggcgcgta ctccacctca TGGTACC) (SEQ ID NO: 3). 

The plasmid pFY15 contained 30 bp of attB (AAGCTT ccagggcgtg 
cccttgggct ccccgggcgc ATGGTACC) (SEQ ID NO: 4). 

Inte grating attP plasmids dFY 17. pFY19. pFY20 

The hpt gene encoding for resistance to hygromycin, obtained as a 1.6 kb 
BamW to Kpnl fragment from pEDl 13, was inserted between the BamYO. and Kpnl sites of 
pBluescript II SK to generate the control plasmid pBSK-hpt. 

A synthetic oligonucleotide containing different lengths of the attP site was 
inserted between Sad (GTCGAQ and BamHl (GGATCQ sites in pBSK-hpt to generate the 
following plasmids: 
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a) The plasmid pFYl 7 contains 90 bp ( GAGCTC-g aagcggttt tcgggagtag- 
tgccccaact ggggtaacct ttgagltctc tcagttgggg gcgtagggtc gccgacatga cacaaggggt-GGATCC) of 
attP site (SEQ ID NO: 5). 

b) The plasmid pFY19 contains 50 bp of attP site (GAGCTC-tgccccaact 
ggggtaacct ttgagttctc tcagttgggg gcgtagggtc-GGATCQ (SEQ ID NO: 6). 

c) The plasmid pFY20 contains 32 bp of attP site (GAGCTC-actggggtaa 
cctttgagtt ctctcagttgjgATCC) (SEQ ID NO: 7) is called P FY20. 

Tntefl rase expressi ng construct pFY6 

An EcdRl to BamM fragment containing the nearly complete open reading 
frame of the integrase gene was inserted between the £coRI and Bamm sites of 
pcDNA3.1/Zeo(-) (Invitrogen). A synthetic oligonucleotide (GGGCCCGCCACGATGACA 
CAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCG 
CGCGAGCGCGAGAATTC) (SEQ ID NO: 8) containing a Kozack sequence and the N- 
terminal amino acid coding sequences of the integrase gene was subsequently inserted 
between the Apal and EcdRl sites to reconstruct the open reading frame. This orientation 
places a complete integrase coding region under the control of the CMV (human 
cytomegalovirus) promoter in pcDNA3.1/Zeo(-). 

Transfection protocol 

The CHO cell line K-l was transfected with attB target constructs pFY12, 
) pFY14 or pFY15 (Figure 3). These plasmids harbor the selectable marker for neomycin 
resistance, and an attB site of various lengths located between Pc (human cytomegalovirus 
promoter) and the lacZ coding region. Plasmids pFY12, pFY14 and P FY15 contain, 
respectively, 90, 50 and 30 bp of the attB sequence. Neomycin-resistant cell lines were 
obtained from consecutive purification of single colonies. Four lines of each construct were 
5 used for integration experiments. 

Each of the 1 2 lines was transfected with pFY6, a <DC3 1 integrase expression 
plasmid, along with an integration vector, pFY17, P FY19, or P FY20. The plasmids P FY17, 
pFY19 and P FY20 harbor an attP sequence of lengths 90, 50 and 32 bp, respectively. The 
attP sequence is situated upstream of the hpt open reading frame, which encodes 
SO hygromycin phosphotransferase, an enzyme that confers resistance to hygromycin. There is 
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no promoter upstream of the attP-hpt segment and hpt is therefore not expressed unless the 
plasmid integrates into the genome in such a way that the hpt coding region fuses with a 
genomic promoter. For control, pBSK-hpt was used to monitor the frequency of promoter 
fusion to hpt. The plasmid pBSK-hpt is identical to pFY17, pFY19, and pFY20 except it 
5 lacks an attP sequence. The recombination between attP and attB sites is expected to insert 
the integration vector into the chromosome target to generate a Pc-attL-hpt linkage. 
Expression of hpt will confer resistance to hygromycin. 

Results 

Transfection results: Hygromycin resistant colonies were scored for each 
10 integration plasmid which was transfected into the 12 cell lines (Table 2). From lxlO* cells 
plated, pBSK-hpt transfections failed to produce a significant number of resistant colonies. 
This indicates that the frequency of the hpt coding region fusing to a genomic promoter is 
extremely low. In contrast, pFY17, pFY19 and pFY20 yielded up to a thousand fold higher 
number of hygromycin resistant colonies, depending on the particular integration plasmid 
15 and the particular cell line. Higher numbers of hygromycin resistant colonies were produced 
from the transfection of pFY19 or P FY17 into FY12 lines. This indicates that the 
recombination between longer attB and attP sequences is more efficient than the 
recombination between shorter attB and attP sites. 

PCR was used to detect the expected -0.8 Kb junction band from 
20 representative colonies. Primers corresponding to the human cytomegalovirus promoter and 
the hpt coding region amplified a PCR product of the expected size (0.8 kb). This indicates 
that Pc is linked to the hpt coding region, consistent with recombination between the 
genomic attB site and the plasmid attP sequence. 

Example 4 

25 Tha (DC31 Integra Functions in Plant Chromosomes to Recombine attP and attB Sites 
This example describes an experiment in which the «DC31 integrase was 
tested for ability to recombine attP and attB sites that are present in a plant chromosome. 
The constructs and strategy for this experiment are shown in Figure 4. 



30 



34 



WO 01/07572 



PCT/US00/19983 



Table 2 

Number of hygromycin resistant colonies per lxlO 6 transfected cells. 





Integration plasmids 




Target cell lines 


pFY17 
(90bpAttP) 


pFY19 
(50 bp AttP) 


pFY20 
(32 bp AttP) 


pBSK-hpt 
(no attP site) 


(attB-90) 
FY12-1 
FY12-2 
FY12-3 
FY12-4 


256 
976 
246 
964 


856 
896 
976 
896 


275 
185 
114 
327 


0 
0 
2 
0 


[attB-50) 
FY14-3 
FY14-7 
FY14-8 
FY14-9 


23 
96 
21 
89 


240 
245 
135 
255 


45 
67 
49 
78 


0 
0 
2 
1 


(attB-30) 

FY15-1 

FY15-2 

tY15-3 

FY15-4 


0 
0 
55 
0 


24 
345 
455 
0 


0 
34 
23 
0 


0 
0 

1 

0 



The construct pWP29 contains the fragment consisting of 35S-attP-npt-attB- 
gus, flanked by RB and LB, where 35S is the cauliflower mosaic virus promoter, npt is the 
coding region for neomycin phosphotransferase, and gus is the coding region for 
glucuronidase. RB and LB are the right and left Agrobacterium T-DNA border sequences, 
respectively. The attP site between 35S and npt serves as a rion-translated leader sequence. 
Transcription of npt by 35S confers resistance to kanamycin. The gus coding region is not 
transcribed due to the lack of an upstream promoter. 

A second construct used for plant transformation is P WP24. This construct 
contains the fragment Pnas-npt-35S-int, flanked by RB and LB, where Pnos is the nopaline 
synthase promoter, and int is the <DC31 integrase coding region. Both npt and int are 
transcribed from their respective upstream promoters. 
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If the two constructs were present in the same genome, the expression of int 
from the pWP24 bearing chromosome would be expected to produce functional OC31 
integrase to catalyze the recombination between attB and attP sites situated on the pWP29- 
bearing chromosome. The recombination event would be expected to delete the npt gene 
5 from the pWP29 construct and fuse 3SS to gits. The resulting configuration would be 35S- 
attR-gus, where attR is a hybrid site formed by the recombination between attP and attB, 
also designated as PB' (Figure 4). The deletion of npt brings gus under the transcription of 
35S and would be expected to yield plants with GUS enzyme activity. This activity can be 
detected through histochemical staining of the plant tissue. 

10 Results 

A transient expression assay was conducted to determine whether pWP29 
was functional for recombination. Through the biolistics-mediated delivery of naked DNA, 
pWP29 was cointroduced with pWP8 into maize BMS cells. The construct pWP8 has the 
integrase gene fused behind the maize ubiquitin promoter for expression in monocot cells. 
1 5 Blue spots were observed when both plasmids were co-introduced, but were not found if 
only one of the plasmids was used. This indicated that site-specific recombination took 
place in maize cells and that the attP and attB sites in pWP29 were functional sites. 

Kanamycin resistant tobacco plants were regenerated by Agrobacterium- 
mediated transformation using pWP29 or P WP24. Another transient expression assay was 
20 conducted to determine whether the pWP24 lines produced functional integrase. The 

construct pWP29 was introduced into the pWP24 plants through biolistics mediated delivery 
of naked DNA. Cells that take up the pWP29 DNA would be expected to express GUS 
enzyme activity as a result of the formation of a 35S-attR-gus configuration. Indeed, two 
lines, 24.3 and 24.4 yielded blue spots consistent of functional integrase-mediated site- 
25 specific recombination between the attP and attB sites. 

These two pWP24 integrase lines were crossed to pWP29 tester lines to 
produce progeny with the chromosomes carrying pWP29 and pWP24 in the same genome. 
Table 3 summarizes the results from the genetic crosses between integrase (24.3, 24.4) and 
tester lines (29.2, 29.4,29.5, 29.19). In each case, representative progeny seedlings were 
30 germinated in the absence of selection and histochemicaUy stained for GUS enzyme activity. 
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The table lists the number of progeny that stained blue. As the primary transformed pWP24 
and pWP29 lines are hemizygous for their respective transgene, only a quarter of the 
progeny would be expected to carry both transgene types. The sample sizes were small, so 
an apparent deviation from the expected frequency is not unusual. 

Table 3: Progeny that showed gus expression from histochemical staining. 



Male Donor plant Female Recipient 
line plant line 


Number of 
progeny stained 
for gus activity 


Number of 
progeny that 
show gus 
activity. 


% positive for 
gus activity 


24.3+ 


29.2 


38 


11 


29% 


29.2 


24.3+ 


38 


1 


2.6% 




29.4 


24.3+ 


18 


3 


16% 




24.3+ 


29.5 


38 


4 


10% 


29.5 


24.3+ 


26 


0 


0 




24.4 


29.2 


38 


7 


19% 


29.2 


24.4+ 


38 


7 


19% 




29.4 


24.4+ 


19 


8 


42% 




24.4+ 


29.5 


38 


17 


45% 


29.5 


24.4+ 


20 


6 


30% 




29.19 


24.4+ 


18 


7 


39% 



The intensity of staining varied depending on the combination of lines used as 
10 parental lines. Those with progeny with a greater proportion of the tissue staining blue 

indicate that the recombination event was more efficient. Conversely, those yielding progeny 
with less uniform staining indicate that the recombination event was less efficient. This 
variation among the different progeny pools is probably due to effects caused by the position 
of integration of the transgenes. Of the two integrase lines, 24.4 appears more efficient in 
15 promoting site-specific recombination. This is probably due to a higher level of int gene 

expression. Staining patterns produced by crossing 24.4 to 29.4 and 29.19 are consistent with 
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the experimental design that int promoted site-specific recombination of attB and attP results 
in the activation of gus gene activity. 

Example 5 

esm Tntegra s* Catalyzes Integ r at i o n o f a C i rcular Plasmid into a Plant Chromosome 

This example describes an experiment in which the OC31 integrase was 
tested for ability to insert a circular plasmid molecule into the plant chromosome through 
attP x attB site-specific recombination. This experiment is diagrammed in Figure 5. 



The target construct pWP6 contains the fragment consisting of 3SS-attP-npt, 
flanked by RB and LB. The attP site between 35S and npt serves as a non-translated leader 
sequence. Transcription of npt by 35S confers resistance to kanamycin. 

The integrating construct pYJC43 has the fragment attB-hpt, where hpt codes 
for resistance to hygromycin. The integrase expression construct is P YJC41, in which 35S 
transcribes int. 

The target construct pWP6 was placed into a plant chromosome through 
random integration of pWP6 DNA. Kanamycin resistant plants harboring a single copy of 
the pWP6 transgene are then subsequently transformed with pYJC43 and pYJC41. The 
transient expression of int from pYJC41 was expected to catalyze the recombination 
between the attB site of pYJC43 and the chromosomally-situated attP site of the pWP6 
transgene. The specific recombination between attB and attP sites would insert the pYJC43 
circular molecule into the chromosome to generate a 35S-attL-hpt linkage. Note that 
because the attP and attB sites are depicted in the inverted orientation, the attL site will 
likewise be in an inverted orientation, or designated P'B, the same as BP' in the drawn in an 
inverted orientation. A functional 35S-attL-hpt linkage would confer a hygromycin 



Kanamycin resistant tobacco plants harboring pWP6 were obtained through 
Agrobacterium-mediated transformation. Southern hybridization analysis detected one line 
that harbors a single copy of the pWP6 transgene. Progeny from this line, WP6.1, were 
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germinated aseptically and protoplasts were made from these plants. The protoplasts were 
transformed by the combination of pYJC43 and P YJC41 DNA by the polyethylene glycol 
method for direct DNA transformation. The protoplasts were then imbedded into agarose 
and cultured to form calli in the presence of hygromycin. The rate of callus formation in the 
absence of hygromycin selection was 4 x 1(K This is about 10 fold lower than usual, but is 
within the range of variability observed in protoplast transformation experiments. In the 
presence of hygromycin selection, the rate of callus formation was 7 x lCr*. This indicates 
that about 1 8% of the calli that regenerated from protoplasts contained the integration vector 
at the target site. When the integrase construct pYJC41 was excluded from the 
transformation, the rate of callus formation was <1 x 10-5. jhe higher frequency of 
hygromycin resistant calli produced by inclusion of the integrase expressing plasmid 
pYJC41 is consistent with the integrase promoted site-specific integration of pYJC43 into 
the chromosomal attP target. 

It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference for all purposes. 
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WHAT IS CLAIMED IS : 

1 1 . A eukaryotic cell that comprises a prokaryotic recombinase polypeptide 

2 or a nucleic acid that encodes a prokaryotic recombinase, wherein the recombinase can 

3 mediate site-specific recombination between a first recombination site and a second 

4 recombination site that can serve as a substrate for recombination with the first 

5 recombination site, but in the absence of an additional factor that is not present in the 

6 eukaryotic cell cannot mediate recombination between two hybrid recombinase 

7 recombination sites that are formed upon recombination between the first recombination site 

8 and the second recombination site. 

1 2. The eukaryotic cell of claim 1 , wherein the recombinase is selected 

2 from the group consisting of a bacteriophage OC3 1 integrase, a coliphage P4 recombinase, a 

3 Listeria phage recombinase, a bacteriophage R4 Sre recombinase, a CisA recombinase, an 

4 XisF recombinase, and a transposon Tn44Sl TnpX recombinase. 

1 3. The eukaryotic cell of claim 1, wherein the recombinase is a 

2 bacteriophage 0X31 integrase. 

1 4. The eukaryotic cell of claim 1 , wherein the first recombination site is an 

2 attB site and the second recombination site is an attP site. 

1 5. The eukaryotic cell of claim 1, wherein the cell further comprises a first 

2 recombinase recombination site. 

1 6. The eukaryotic cell of claim 1 , wherein the cell comprises a nucleic acid 

2 that comprises a coding sequence for an recombinase polypeptide, which coding sequence is 

3 operably linked to a promoter that mediates expression of the recombinase-encoding 

4 polynucleotide in the eukaryotic cell. 

1 7. The eukaryotic cell of claim 6, wherein the nucleic acid further 

2 comprises a selectable marker. 
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1 8. The eukaryotic cell of claim 6, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 9. The eukaryotic cell of claim 8, wherein the nucleic acid is the plasmid 

2 pLT43. 

1 10. The eukaryotic cell of claim 1, wherein the eukaryotic cell is selected 

2 from the group consisting of an animal cell, a plant cell, a yeast cell, an insect cell and a 

3 fungal cell. 



1 



11. The eukaryotic cell of claim 10, wherein the eukaryotic cell is a 



a cell. 



1 1 2. The eukaryotic cell of claim 1 0, wherein the eukaryotic cell is present in 

2 a multicellular organism. 

1 1 3. A method for obtaining site-specific recombination in a eukaryotic cell, 

2 the method comprising: 

3 providing a eukaryotic cell that comprises a first recombination site and 

4 a second recombination site, which second recombination site can serve as a substrate for 

5 recombination with the first recombination site; 

6 contacting the first and the second recombination sites with a 

7 prokaryotic recombinase polypeptide, resulting in recombination between the recombination 

8 sites, thereby forming one or two hybrid recombination sites; 

9 wherein the recombinase polypeptide can mediate site-specific 

10 recombination between the first and second recombination sites, but cannot mediate 

1 1 recombination between two hybrid recombination sites in the absence of an additional factor 

12 that is not present in the eukaryotic cell. 

1 14. The method of claim 13, wherein the eukaryotic cell is selected from the 

2 group consisting of a yeast cell, a fungal cell, a plant cell, an insect cell and an animal cell. 
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1 15. The method of claim 13, wherein the first recombination site is present 

2 in a chromosome of the eukaryotic cell. 

1 16. The method of claim 15, wherein the second recombination site is 

2 present in a second chromosome of the eukaryotic cell and contacting the first and second 

3 recombination sites with the recombinase results in translocation of chromosome arms. 

1 1 7. The method of claim 1 3, wherein the first recombination site and the 

2 second recombination site are present on a single nucleic acid molecule. 

1 18. The method of claim 17, wherein the first recombination site and the 

2 second recombination site are in a direct orientation. 

1 1 9. The method of claim 1 8, wherein the recombination results in excision 

2 ofthe portion of the nucleic acid molecule that lies between the first and second 

3 recombination sites. 

1 20. The method of claim 17, wherein the first recombination site and the 

2 second recombination site are in an inverted orientation. 

1 2 1 . The method of claim 20, wherein the recombination results in inversion 

2 ofthe portion ofthe nucleic acid molecule that lies between the first and second 

3 recombination sites. 

1 22. The method of claim 1 3, wherein the eukaryotic cell comprises a 

2 polynucleotide that encodes the recombinase polypeptide. 

1 ,23. The method of claim 22, wherein the recombinase-encoding 

2 polynucleotide is operably linked to a promoter which mediates expression of the 

3 polynucleotide in the eukaryotic cell. 
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1 24. The method of claim 23, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 25. The method of claim 24, wherein the promoter is a Pmnt promoter. 

1 26. A method for obtaining a eukaryotic cell having a stably integrated 

2 transgene, the method comprising: 

3 introducing a nucleic acid into a eukaryotic cell that comprises a first 

4 recombination site, wherein the nucleic acid comprises a transgene and a second 

5 recombination site which can serve as a substrate for recombination with the first 

6 recombination site; and 



7 contacting the first and the second recombination sites with a 

8 prokaryotic recombinase polypeptide, wherein the recombinase polypeptide catalyzes 

9 . recombination between the first and second recombination sites, resulting in integration of 

1 0 the nucleic acid at the first recombination site, thereby forming a hybrid recombination site 

11 at each end of the nucleic acid; 

1 2 wherein the recombinase polypeptide can mediate site-specific 

1 3 recombination between the first and second recombination sites, but cannot mediate 

14 recombination between two hybrid recombination sites in the absence of an additional factor 

15 that is not present in the eukaryotic cell. 

1 27. The method of claim 26, wherein the recombinase polypeptide is 

2 selected from the group consisting of a bacteriophage 0C31 integrase, a coliphage P4 

3 recombinase, a Listeria phage recombinase, a bacteriophage R4 Sre recombinase, a CisA 

4 recombinase, an XisF recombinase, and a transposon Tn4451 TnpX recombinase. 

1 28. The method of claim 27, wherein the recombinase is a OC3 1 integrase. 

1 29. The method of claim 26, wherein the recombinase polypeptide is 

2 introduced into the eukaryotic cell by expression of a polynucleotide that encodes the 

3 recombinase polypeptide. 
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! 30. The method of claim 29, wherein the polynucleotide that encodes the 

2 recombinase polypeptide is operably linked to a promoter that functions in the eukaryotic 

3 cell. 

! 31. The method of claim 30, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 32. A nucleic acid that comprises a polynucleotide sequence that encodes a 

2 bacterial recombinase polypeptide operably linked to a promoter that functions in a 

3 eukaryotic cell, wherein the recombinase polypeptide cannot mediate recombination between 

4 two hybrid recombination sites that are formed upon recombination between a first 

5 recombination site and a second recombination site in the absence of an additional factor. 



1 33. The nucleic acid of claim 32, wherein the nucleic acid further comprises 

2 at least one recombination site that is recognized by the recombinase polypeptide. 

1 34. The nucleic acid of claim 32, wherein the nucleic acid comprises a 

2 plasmid vector. 

1 35. The nucleic acid of claim 34, wherein the vector is pLT43. 

! 36. A eukaryotic cell that comprises a polynucleotide that comprises a first 

2 bacteriophage <DC3 1 recombination site. 

1 37. The eukaryotic cell of claim 36, wherein the recombination site is 

2 selected from the group consisting of attP and attB. 

j 3 8 . The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a second polynucleotide that comprises a second 0>C3 1 recombination site that 

3 undergoes recombination with the first <DC3 1 recombination site when contacted with a 

4 <DC3 1 integrase polypeptide. 
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1 39. The eukaryotic cell of claim 38, wherein: 

2 the first recombination site is attB and the second recombination site is 

3 attP; or 

4 the first recombination site is attP and the second recombination site is 

5 attB. 

1 40. The eukaryotic cell of claim 38, wherein the second polynucleotide 

2 further comprises a transgene. 

1 41. The eukaryotic cell of claim 38, wherein the second polynucleotide 

2 further comprises a selectable marker. 

1 42. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a OC3 1 integrase polypeptide. 

1 43. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a nucleic acid that comprises a polynucleotide that encodes a <DC31 integrase 

3 polypeptide. 

1 44. The eukaryotic cell of claim 43 , wherein the nucleic acid further 

2 comprises a selectable marker. 

1 45. The eukaryotic cell of claim 43, wherein the nucleic acid further 

2 comprises a promoter which results in expression of the <DC3 1 integrase-encoding 

3 polynucleotide in the cell. 

1 46. The eukaryotic cell of claim 45, wherein the promoter is an inducible 

2 promoter. 

1 47. The eukaryotic cell of claim 36, wherein the eukaryotic cell is selected 

2 from the group consisting of a yeast cell, a fungal cell, a plant cell, and an animal cell. 
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Figure 2 

Transgene Integration in CHO Cell Line 
Site Specific Expression of GFP 
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Figure 3 

Transgene Integration in CHO Cell Line 
Hygromycin Resistance from attB x attP recombination 



pBSK-hpt(noattP) 
pFY17 (attP-90) 
P FY19(attP-50) ~~ 
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Pc = human cytomegalovirus promoter 




Ps = SV40 early promoter 


\ PP'| = attP 


zeo = zeomycin resistance coding region 


I BB'| = attB 


neo = neomycin resistance coding region 


1 PB'l = attR 


hpt = hygromycin resistance coding region 


[bFI = attL 


lacZ = beta galactosidase coding region 
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Excision of DNA from Tobacco Genome 
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Figure 5 

Integration of DNA into the tobacco genome 




[~FF1 = attP 
| BB't = a«B 
[TF1 = attR 
rBFI = attL 
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